AITopics | inner problem

Collaborating Authors

inner problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees Dohyeong Kim

Neural Information Processing SystemsFeb-17-2026, 06:53:18 GMT

However, the nonlinearity of risk measures makes it challenging to achieve convergence and optimality.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.65)

Add feedback

ThompsonSamplingwithInformationRelaxation Penalties

Neural Information Processing SystemsFeb-14-2026, 19:58:27 GMT

Weconsider afinite-horizon multi-armed bandit (MAB) problem inaBayesian setting, for which we propose aninformation relaxation samplingframework.

artificial intelligence, data mining, irs, (17 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.69)
Information Technology > Data Science > Data Mining > Big Data (0.34)

Add feedback

DistributedInverseConstrainedReinforcement LearningforMulti-agentSystems

Neural Information Processing SystemsFeb-12-2026, 05:20:06 GMT

We formally guarantee that the distributed learners asymptotically achieve consensus which belongs to the set of stationary points of the bi-level optimization problem.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Country: North America > United States (0.14)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Add feedback

1c32452f112719f7c1db6d983d060f78-Paper-Conference.pdf

Neural Information Processing SystemsFeb-8-2026, 22:28:42 GMT

experiment, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

A Saddle Point Remedy: Power of Variable Elimination in Non-convex Optimization

Gan, Min, Chen, Guang-Yong, Yi, Yang, Yang, Lin

arXiv.org Machine LearningNov-4-2025

The proliferation of saddle points, rather than poor local minima, is increasingly understood to be a primary obstacle in large-scale non-convex optimization for machine learning. Variable elimination algorithms, like Variable Projection (VarPro), have long been observed to exhibit superior convergence and robustness in practice, yet a principled understanding of why they so effectively navigate these complex energy landscapes has remained elusive. In this work, we provide a rigorous geometric explanation by comparing the optimization landscapes of the original and reduced formulations. Through a rigorous analysis based on Hessian inertia and the Schur complement, we prove that variable elimination fundamentally reshapes the critical point structure of the objective function, revealing that local maxima in the reduced landscape are created from, and correspond directly to, saddle points in the original formulation. Our findings are illustrated on the canonical problem of non-convex matrix factorization, visualized directly on two-parameter neural networks, and finally validated in training deep Residual Networks, where our approach yields dramatic improvements in stability and convergence to superior minima. This work goes beyond explaining an existing method; it establishes landscape simplification via saddle point transformation as a powerful principle that can guide the design of a new generation of more robust and efficient optimization algorithms.

artificial intelligence, machine learning, saddle point, (18 more...)

arXiv.org Machine Learning

2511.01234

Country:

Asia > China (0.46)
North America (0.46)
Europe (0.46)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.88)

Add feedback

Spectral-Risk Safe Reinforcement Learning with Convergence Guarantees Dohyeong Kim

Neural Information Processing SystemsOct-10-2025, 12:34:37 GMT

However, the nonlinearity of risk measures makes it challenging to achieve convergence and optimality.

algorithm, constraint, risk measure, (15 more...)

Neural Information Processing Systems

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report > Experimental Study (0.93)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.65)

Add feedback

Collaborative Learning via Bilevel Optimization

Neural Information Processing SystemsOct-9-2025, 20:12:08 GMT

Identifying helpful clients, however, presents challenging and often introduces significant overhead.

experiment, federated learning, learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > Virginia (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Communications (0.83)
Information Technology > Artificial Intelligence > Natural Language (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

Thompson Sampling with Information Relaxation Penalties Seungki Min Columbia Business School Costis Maglaras Columbia Business School Ciamac C. Moallemi Columbia Business School

Neural Information Processing SystemsAug-20-2025, 07:27:44 GMT

We consider a finite-horizon multi-armed bandit (MAB) problem in a Bayesian setting, for which we propose an information relaxation sampling framework.

inner problem, penalty function, relaxation, (13 more...)

Neural Information Processing Systems

Country: North America > Canada (0.04)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Information Technology > Data Science > Data Mining (0.66)

Add feedback

Distributed Inverse Constrained Reinforcement Learning for Multi-agent Systems

Neural Information Processing SystemsAug-19-2025, 07:49:35 GMT

This paper considers the problem of recovering the policies of multiple interacting experts by estimating their reward functions and constraints where the demonstration data of the experts is distributed to a group of learners. We formulate this problem as a distributed bi-level optimization problem and propose a novel bi-level "distributed inverse constrained reinforcement learning" (D-ICRL) algorithm that allows the learners to collaboratively estimate the constraints in the outer loop and learn the corresponding policies and reward functions in the inner loop from the distributed demonstrations through intermittent communications. We formally guarantee that the distributed learners asymptotically achieve consensus which belongs to the set of stationary points of the bi-level optimization problem. Simulations are done to validate the proposed algorithm.

artificial intelligence, constraint, machine learning, (13 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania (0.04)

Industry: Information Technology (0.46)

Technology: